A new method for multi-oriented graphics-scene-3D text classification in video
نویسندگان
چکیده
Text detection and recognition in video is challenging due to the presence of different types of texts, namely, graphics (video caption), scene (natural text), 2D, 3D, static and dynamic texts. Developing a universal method that works well for all these types is hard. In this paper, we propose a novel method for classifying graphics-scene and 2D-3D texts in video to enhance text detection and recognition accuracies. We first propose an iterative method to classify static text and dynamic text clusters based on the fact that static texts have zero velocity while dynamic texts do not. This results in text candidates for both static and dynamic texts regardless of 2D and 3D types. We then propose symmetry detection for text candidates using stroke width distances and medial axis values. This process gives rise to potential text candidates. We group potential text candidates using their geometrical properties to form text regions. Next, for each text region, we study the distribution of dominant medial axis values given by ring radius transform in a new way to classify graphics and scene texts. Similarly, we study the proximity among the pixels that satisfy the gradient directions symmetry to classify 2D and 3D texts. We evaluate each step of the proposed method in terms of classification and recognition rates in comparison with several existing methods to show that video text classification is effective and necessary for enhancing the capability of current text detection and recognition systems.
منابع مشابه
on Pattern Recognition 2 D and 3 D Video Scene Text Classification
Text detection and recognition is a challenging problem methods degrades drastically [4,5) because of the variations in edge in document analysis due to the presence of the unpredictable nature pattern and strength. For instance, In Figure I, (a) shows 2D characters of video texts, such as the variations of orientation, font and size, chosen from video, (b) shows a 3D character from video but i...
متن کاملFast Intra Mode Decision for Depth Map coding in 3D-HEVC Standard
three dimensional- high efficiency video coding (3D-HEVC) is the expanded version of the latest video compression standard, namely high efficiency video coding (HEVC), which is used to compress 3D videos. 3D videos include texture video and depth map. Since the statistical characteristics of depth maps are different from those of texture videos, new tools have been added to the HEVC standard fo...
متن کامل3D Scene and Object Classification Based on Information Complexity of Depth Data
In this paper the problem of 3D scene and object classification from depth data is addressed. In contrast to high-dimensional feature-based representation, the depth data is described in a low dimensional space. In order to remedy the curse of dimensionality problem, the depth data is described by a sparse model over a learned dictionary. Exploiting the algorithmic information theory, a new def...
متن کاملDevelopment of MPEG Standards for 3D and Free Viewpoint Video
An overview of 3D and free viewpoint video is given in this paper with special focus on related standardization activities in MPEG. Free viewpoint video allows the user to freely navigate within real world visual scenes, as known from virtual worlds in computer graphics. Suitable 3D scene representation formats are classified and the processing chain is explained. Examples are shown for image-b...
متن کاملA Skeleton-Based Method for Multi-Oriented Text Detection
In this paper, we propose a method based on the skeletonization operation for multi-oriented text detection. The first step uses our existing Laplacian-based method to identify candidate text regions. In the second step, each region is classified as either a simple connected component (a single text string) or a complex connected component (multiple text strings that are connected to each other...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Pattern Recognition
دوره 49 شماره
صفحات -
تاریخ انتشار 2016